Automatic classification of discourse markers on the basis of their co-occurrences
نویسنده
چکیده
A long-standing linguistic hypothesis asserts that the meanings of words are related to the contexts in which they appear (Miller and Charles 1991). This paper explores this hypothesis by showing that co-occurrences of discourse markers reflect the meanings of the discourse markers themselves. An experiment in classifying discourse markers by their semantic class, e.g. temporal or causal, was carried out, achieving an accuracy level of approximately 75%. Analysis shows that performance differs widely across classes: temporal and negative polarity markers are classified most accurately, while additive and hypothetical markers are classified poorly.
منابع مشابه
A New Document Embedding Method for News Classification
Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...
متن کاملContrasting the Automatic Identification of Two Discourse Markers in Multiparty Dialogues
The identification of occurrences of like and well that serve as discourse markers (DMs) is a classification problem which is studied here on a corpus of dialogue transcripts with more than 4,000 occurrences of each item. Decision trees using item-specific lexical, prosodic, positional and sociolinguistic features are trained using the C4.5 method. The results demonstrate improvement over past ...
متن کاملPragmatic Annotation of Discourse Markers in a Multilingual Parallel Corpus (Arabic- Spanish-English)
Discourse structure and coherence relations are one of the main inferential challenges addressed by computational pragmatics. The present study focuses on discourse markers as key elements in guiding the inferences of the statements in natural language. Through a rule-based approach for the automatic identification, classification and annotation of the discourse markers in a multilingual parall...
متن کاملThe Analysis of the Discourse Markers in the Narratives Elicited from Persian-speaking Children
Discourse markers (DMs) are linguistic elements that index different relations and coherence between units of talk. Most research on the development of these forms has focused on conversations rather than narratives. This article examines age and medium effects on use of various discourse markers in pre-school children. Fifteen normal Iranian monolingual children, male and female, participated ...
متن کاملMetadiscourse Markers: A Contrastive Study of Translated and Non-Translated Persuasive Texts
Metadiscourse features are those facets of a text, which make the organization of the text explicit, provide information about the writer's attitude toward the text content, and engage the reader in the interaction. This study interpreted metadiscourse markers in translated and non-translated persuasive texts. To this end, the researcher chose the translated versions of one of the leading newsp...
متن کامل